Dependence Models for Searching Text in Document Images
نویسندگان
چکیده
منابع مشابه
Searching in Document Images
Searching in scanned documents is an important problem in Digital Libraries. If OCRs are not available, the scanned images are inaccessible. In this paper, we demonstrate a searching procedure without an intermediate textual representation. We achieve effective retrieval from document databases by matching at word-level using image features. Word profiles, structural features and transform doma...
متن کاملKeyword Searching in Compressed Document Images
A huge amount of document images are accessible in the Internet and digital libraries. We find that, most of them are packed in PDF files and are compressed using CCITT Group 4 standards for saving storage space and speeding up transmission. There is thus significant meaning to develop the methods of directly searching keywords from these documents. In this paper, we present a compressed patter...
متن کاملText Line for Historical Document Images
In this paper we present a new approach for text line segmentation that works directly on gray-scale document images. Our algorithm constructs distance transform directly on the gray-scale images, which is used to compute two types of seams: medial seams and separating seams. A medial seam is a chain of pixels that crosses the text area of a text line and a separating seam is a path that passes...
متن کاملText line extraction for historical document images
0167-8655/$ see front matter 2013 Elsevier B.V. All rights reserved. http://dx.doi.org/10.1016/j.patrec.2013.07.007 ⇑ Corresponding author at: Department of Computer Science, Triangle Research & Development Center, Kafr Qarea, Israel. Fax: +972 4 6356168. E-mail addresses: [email protected] (R. Saabni), [email protected] (A. Asi), [email protected] (J. El-Sana). 1 These authors contribut...
متن کاملInversion Detection in Text Document Images
OCR makes it possible for the user to edit or search the document’s contents. In this paper we describe a special water fill technique for detecting the upside down text document. Each character has a upside and downside filling capacities. A character may have two sides or one side filling capacity or zero filling capacity. The total upside and downside capacities for the scanned page calculat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Pattern Analysis and Machine Intelligence
سال: 2019
ISSN: 0162-8828,2160-9292,1939-3539
DOI: 10.1109/tpami.2017.2780108